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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 




(i) APPLICANT: Amara, Susan G 

Arriza, Jeffrey L 

(ii) TITLE OF INVENTION: Amino Acid Transporters ar 

(iii) NUMBER OF SEQUENCES: 17 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Allegretti & Witcoff, Lt 

(B) STREET: 10 South Wacker Drive, Suj/te 3000 

(C) CITY: Chicago 

(D) STATE: IL 

(E) COUNTRY: USA 

(F) ZIP: 60606 



(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatilile 

(C) OPERATING SYSTEM: PC-D^/MS-DOS 

(D) SOFTWARE: Patentin ReJ/fease #1.0, 



Uses 



Version #1.25 



(vi) CURRENT APPLICATION DATA^ 
(A) 
(B) 

(C) CLASSIFICATION : 



APPLICATION NUMBER/ US 
FILING DATE: ^ 



(viii) ATTORNEY/AGENT INFTOMATION: 

(A) NAME: Noonan/ Kevin E 

(B) REGISTRATIOT NUMBER: 35,303 

(C) REFERENCE/^CKET NUMBER: 93,509 

(ix) TELECOMMUNICOTION INFORMATION: 

(A) TELEPHGWE: 312-715-1000 

(B) TELEf/x: 312-715-1234 

(C) tele/: 910-221-5317 



(2) INFORMATIOiy FOR SEQ ID NOrl: 

(i) SEQUENCE CHARACTERISTICS: 
(a/ LENGTH: 63 base pairs 
{w) TYPE: nucleic acid 
ifC) STRANDEDNESS: single 
^(D) TOPOLOGY: linear 

(ii)/ MOLECULE TYPE: cDNA 



ALLEGRETTI & WITCOF/. Ltd. 
10 SOUTH WACKER ffRIVE 
CHICAGO. ILLINOIS/0606 
TELEPHONE (312) 7/5-1000 



SEQUENCE LISTING 

(1) GENERAL INFORMATION: 

(1) APPLICANT: Amara, Susan G 

Arriza, Jeffrey L 

(ii) TITLE OF INVENTION: Amino Acid Transporters and Uses 

(iii) NUMBER OF SEQUENCES: 17 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Allegretti & Witcoff, Ltd. 

(B) STREET: 10 South Wacker Drive, Suite 3000 

(C) CITY: Chicago 

(D) STATE: IL 

(E) COUNTRY: USA 

(F) ZIP: 60606 

(V) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS /MS-DOS 

(D) SOFTWARE: Patentin Release #1.0, Version #1.25 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: US 08/140,729 

(B) FILING DATE: 20 OCT 1993 

(C) CLASSIFICATION: 

(viii) ATTORNEY /AGENT INFORMATION: 

(A) NAME: Noonan, Kevin E 

(B) REGISTRATION NUMBER: 35,303 

(C) REFERENCE/DOCKET NUMBER: 93,509 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 312-715-1000 

(B) TELEFAX: 312-715-1234 

(C) TELEX: 910-221-5317 



(2) INFORMATION FOR SEQ ID N0:1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 63 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:1: 
CTGRGCRATG AARATGGCAG CCAGGGCYTC ATACAGGGCT GTGCCRTCCA TGTTRATGGT 
RGC 

(2) INFORMATION FOR SEQ ID NO: 2: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1680 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(ix) FEATURE: 

(A) NAME/KEY: 5'UTR 

(B) LOCATION: 1..30 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 31.. 1626 

(ix) FEATURE: 

(A) NAME/KEY: 3'UTR 

(B) LOCATION: 1626.. 1680 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

CACCTCTAGC TCGGAGCGGC GTGTAGCGCC ATG GAG AAG AGC AAC GAG ACC AAC 54 

Met Glu Lys Ser Asn Glu Thr Asn 
1 5 

GGC TAC CTT GAC AGC GCT CAG GCG GGG CCT GCG GCC GGG CCC GGA GCT 102 
Gly Tyr Leu Asp Ser Ala Gin Ala Gly Pro Ala Ala Gly Pro Gly Ala 
10 15 20 

CCG GGG ACC GCG GCG GGA CGC GCA CGG CGT TGC GCG CGC TTC CTG CGG 150 
Pro Gly Thr Ala Ala Gly Arg Ala Arg Arg Cys Ala Arg Phe Leu Arg 
25 30 35 40 

CGC CAA GCG CTG GTG CTG CTC ACC GTG TCC GGG GTG CTG GCG GGC GCG 198 
Arg Gin Ala Leu Val Leu Leu Thr Val Ser Gly Val Leu Ala Gly Ala 

45 50 55 

GGC CTG GGC GCG GCG TTG CGC GGG CTC AGC CTG AGC CGC ACG CAG GTC 246 
Gly Leu Gly Ala Ala Leu Arg Gly Leu Ser Leu Ser Arg Thr Gin Val 
60 65 70 

ACC TAC CTG GCC TTC CCC GGC GAG ATG CTG CTC CGC ATG CTG CGC ATG 294 
Thr Tyr Leu Ala Phe Pro Gly Glu Met Leu Leu Arg Met Leu Arg Met 
75 80 85 

ATC ATC CTG CCG CTG GTG GTC TGC AGC CTG GTG TCG GGC GCC GCC TCG 342 
lie lie Leu Pro Leu Val Val Cys Ser Leu Val Ser Gly Ala Ala Ser 
90 95 100 

CTC GAT GCC AGC TGC CTC GGG CGT CTG GGC GGC ATC CGT GTC GCC TAC 390 
Leu Asp Ala Ser Cys Leu Gly Arg Leu Gly Gly lie Arg Val Ala Tyr 
105 110 115 120 

TTT GGC CTC ACC ACA CTG AGT GCC TCG GCG CTC GCC GTG GCC TTG GCG 438 
Phe Gly Leu Thr Thr Leu Ser Ala Ser Ala Leu Ala Val Ala Leu Ala 

125 130 135 



TTC ATC ATC AAG CCA GGA TCC GOT GCG CAG ACC CTT CAG TCC AGC GAC 486 
Phfe lie* lie Lys Pro Gly Ser Gly Ala Gin Thr Leu Gin Ser Ser Asp 
140 145 150 

CTG GGG CTG GAG GAC TCG GGG CCT CCT CCT GTC CCC AAA GAG ACG GTG 534 
Leu Gly Leu Glu Asp Ser Gly Pro Pro Pro Val Pro Lys Glu Thr Val 
155 160 165 

GAC TCT TTC CTC GAC CTG GCC AGA AAC CTG TTT CCC TCC AAT CTT GTG 582 
Asp Ser Phe Leu Asp Leu Ala Arg Asn Leu Phe Pro Ser Asn Leu Val 
170 175 180 

GTT GCA GCT TTC CGT ACG TAT GCA ACC GAT TAT AAA GTC GTG ACC CAG 630 
Val Ala Ala Phe Arg Thr Tyr Ala Thr Asp Tyr Lys Val Val Thr Gin 
185 190 195 200 

AAC AGC AGC TCT GGA AAT GTA ACC CAT GAA AAG ATC CCC ATA GGC ACT 678 
Asn Ser Ser Ser Gly Asn Val Thr His Glu Lys lie Pro lie Gly Thr 

205 210 215 

GAG ATA GAA GGG ATG AAC ATT TTA GGA TTG GTC CTG TTT GCT CTG GTG 726 
Glu lie Glu Gly Met Asn lie Leu Gly Leu Val Leu Phe Ala Leu Val 
220 225 230 

TTA GGA GTG GCC TTA AAG AAA CTA GGC TCC GAA GGA GAA GAC CTC ATC 774 
Leu Gly Val Ala Leu Lys Lys Leu Gly Ser Glu Gly Glu Asp Leu lie 
235 240 245 

CGT TTC TTC AAT TCC CTC AAC GAG GCG ACG ATG GTG CTG GTG TCC TGG 822 
Arg Phe Phe Asn Ser Leu Asn Glu Ala Thr Met Val Leu Val Ser Trp 
250 255 260 

ATT ATG TGG TAC GTA CCT GTG GGC ATC ATG TTC CTT GTT GGA AGC AAG 870 
lie Met Trp Tyr Val Pro Val Gly lie Met Phe Leu Val Gly Ser Lys 
265 270 275 280 

ATC GTG GAA ATG AAA GAC ATC ATC GTG CTG GTG ACC AGC CTG GGG AAA 918 
lie Val Glu Met Lys Asp lie lie Val Leu Val Thr Ser Leu Gly Lys 

285 290 295 

TAC ATC TTC GCA TCT ATA TTG GGC CAT GTT ATT CAT GGA GGA ATT GTT 966 
Tyr He Phe Ala Ser He Leu Gly His Val He His Gly Gly He Val 
300 305 310 

CTG CCA CTT ATT TAT TTT GTT TTC ACA CGA AAA AAC CCA TTC AGA TTC 1014 
Leu Pro Leu He Tyr Phe Val Phe Thr Arg Lys Asn Pro Phe Arg Phe 
315 320 325 

CTC CTG GGC CTC CTC GCC CCA TTT GCG ACA GCA TTT GCT ACC TGC TCC 1062 
Leu Leu Gly Leu Leu Ala Pro Phe Ala Thr Ala Phe Ala Thr Cys Ser 
330 335 340 

AGC TCA GCG ACC CTT CCC TCT ATG ATG AAG TGC ATT GAA GAG AAC AAT 1110 
Ser Ser Ala Thr Leu Pro Ser Met Met Lys Cys He Glu Glu Asn Asn 
345 350 355 360 



GGT GTG GAC AAG AGG ATC AGC AGG TTT ATT CTC CCC ATC GGG GCC ACC 1158 
Gly Val Asp Lys Arg He Ser Arg Phe He Leu Pro He Gly Ala Thr 

365 370 375 



GTG AAC ATG GAG GGA GCA GCC ATC TTC GAG TGT 
Val Asn Met Asp Gly Ala Ala lie Phe Gin Cys 
380 385 



GTG GCC GCG GTG TTC 
Val Ala Ala Val Phe 
390 



1206 



ATT GCG CAA CTC AAC AAC ATA GAG CTC AAC GCA 
lie Ala Gin Leu Asn Asn lie Glu Leu Asn Ala 
395 400 



GGA CAG ATT TTC ACC 
Gly Gin lie Phe Thr 
405 



1254 



ATT CTA GTG ACT GCC ACA GCG TCC AGT GTT GGA 
He Leu Val Thr Ala Thr Ala Ser Ser Val Gly 
410 415 



GCA GCA GGC GTG CCA 
Ala Ala Gly Val Pro 
420 



1302 



GCT GGA GGG GTC CTC ACC ATT GCC ATT ATC CTG 
Ala Gly Gly Val Leu Thr He Ala He He Leu 
425 430 435 



GAG GCC ATT GGG 
Glu Ala He Gly 



CTG 
Leu 
440 



1350 



CCT ACT CAT GAC CTG CCT CTG ATC CTG GCT GTG 
Pro Thr His Asp Leu Pro Leu He Leu Ala Val 

445 450 



GAC TGG ATT GTG GAC 
Asp Trp He Val Asp 
455 



1398 



CGG ACC ACC ACG GTG GTG AAT GTG GAG GGG GAT 
Arg Thr Thr Thr Val Val Asn Val Glu Gly Asp 
460 465 



GCC CTG GGT GCA GGC 
Ala Leu Gly Ala Gly 
470 



1446 



ATT CTC CAC CAC CTG AAT CAG AAG GCA ACA AAG 
He Leu His His Leu Asn Gin Lys Ala Thr Lys 
475 480 



AAA GGC GAG CAG GAA 
Lys Gly Glu Gin Glu 
485 



1494 



CTT GCT GAG GTG AAA GTG GAA GCC ATC CCC AAC 
Leu Ala Glu Val Lys Val Glu Ala He Pro Asn 
490 495 



TGC AAG TCT GAG GAG 
Cys Lys Ser Glu Glu 
500 



1542 



GAG ACA TCG CCC CTG GTG ACA CAC CAG AAC CCC 
Glu Thr Ser Pro Leu Val Thr His Gin Asn Pro 
505 510 515 



GCT GGC CCC GTG 
Ala Gly Pro Val 



GCC 
Ala 
520 



1590 



AGT GCC CCA GAA CTG GAA TCC AAG GAG TCG GTT 
Ser Ala Pro Glu Leu Glu Ser Lys Glu Ser Val 

525 530 



CTG TGATGGGGCT 
Leu 



1636 



GGGCTTTGGG CTTGCCTGCC AGCAGTGATG TCCCACCCTG TTCA 



1680 



(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 532 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

Met Glu Lys Ser Asn Glu Thr Asn Gly Tyr Leu Asp Ser Ala Gin Ala 
15 10 15 

Gly Pro Ala Ala Gly Pro Gly Ala Pro Gly Thr Ala Ala Gly Arg Ala 
20 25 30 



Arg Arg Cys Ala 
35 

Val Ser Gly Val 
50 

Leu Ser Leu Ser 
65 

Met Leu Leu Arg 



Ser Leu Val Ser 
100 

Leu Gly Gly lie 
115 

Ser Ala Leu Ala 
130 

Ala Gin Thr Leu 
145 

Pro Pro Val Pro 



Asn Leu Phe Pro 
180 

Thr Asp Tyr Lys 
195 

His Glu Lys lie 
210 

Gly Leu Val Leu 
225 

Gly Ser Glu Gly 



Ala Thr Met Val 
260 

He Met Phe Leu 
275 

Val Leu Val Thr 
290 

His Val He His 
305 

Thr Arg Lys Asn 



Ala Thr Ala Phe 
340 



Arg Phe Leu Arg 
40 

Leu Ala Gly Ala 

55 

Arg Thr Gin Val 
70 

Met Leu Arg Met 
85 

Gly Ala Ala Ser 



Arg Val Ala Tyr 
120 

Val Ala Leu Ala 

135 

Gin Ser Ser Asp 
150 

Lys Glu Thr Val 
165 

Ser Asn Leu Val 



Val Val Thr Gin 
200 

Pro He Gly Thr 
215 

Phe Ala Leu Val 
230 

Glu Asp Leu He 
245 

Leu Val Ser Trp 



Val Gly Ser Lys 
280 

Ser Leu Gly Lys 
295 

Gly Gly He Val 
310 

Pro Phe Arg Phe 
325 

Ala Thr Cys Ser 



Arg Gin Ala Leu 



Gly Leu Gly Ala 
60 

Thr Tyr Leu Ala 
75 

He He Leu Pro 
90 

Leu Asp Ala Ser 
105 

Phe Gly Leu Thr 



Phe He He Lys 
140 

Leu Gly Leu Glu 
155 

Asp Ser Phe Leu 
170 

Val Ala Ala Phe 
185 

Asn Ser Ser Ser 



Glu He Glu Gly 
220 

Leu Gly Val Ala 
235 

Arg Phe Phe Asn 
250 

He Met Trp Tyr 
265 

He Val Glu Met 



Tyr He Phe Ala 
300 

Leu Pro Leu He 
315 

Leu Leu Gly Leu 
330 

Ser Ser Ala Thr 
345 



Val Leu Leu Thr 
45 

Ala Leu Arg Gly 



Phe Pro Gly Glu 
80 

Leu Val Val Cys 
95 

Cys Leu Gly Arg 
110 

Thr Leu Ser Ala 
125 

Pro Gly Ser Gly 



Asp Ser Gly Pro 
160 

Asp Leu Ala Arg 
175 , 

Arg Thr Tyr Ala 
190 

Gly Asn Val Thr 
205 

Met Asn He Leu 



Leu Lys Lys Leu 
240 

Ser Leu Asn Glu 
255 

Val Pro Val Gly 
270 

Lys Asp He He 
285 

Ser He Leu Gly 



Tyr Phe Val Phe 
320 

Leu Ala Pro Phe 
335 

Leu Pro Ser Met 
350 



Met Lys Cys lie Glu Glu Asn Asn Gly Val Asp Lys Arg lie Ser Arg 
, 355 360 365 

Phe lie Leu Pro lie Gly Ala Thr Val Asn Met Asp Gly Ala Ala lie 
370 375 380 

Phe Gin Cys Val Ala Ala Val Phe lie Ala Gin Leu Asn Asn lie Glu 
385 390 395 400 

Leu Asn Ala Gly Gin lie Phe Thr lie Leu Val Thr Ala Thr Ala Ser 

405 410 415 



Ser Val Gly Ala Ala Gly Val Pro Ala Gly Gly Val Leu Thr He Ala 
420 425 430 

He He Leu Glu Ala He Gly Leu Pro Thr His Asp Leu Pro Leu He 
435 440 445 

Leu Ala Val Asp Trp He Val Asp Arg Thr Thr Thr Val Val Asn Val 
450 455 460 

Glu Gly Asp Ala Leu Gly Ala Gly He Leu His His Leu Asn Gin Lys 
465 470 475 480 

Ala Thr Lys Lys Gly Glu Gin Glu Leu Ala Glu Val Lys Val Glu Ala 

485 490 495 

He Pro Asn Cys Lys Ser Glu Glu Glu Thr Ser Pro Leu Val Thr His 
500 505 510 

Gin Asn Pro Ala Gly Pro Val Ala Ser Ala Pro Glu Leu Glu Ser Lys 
515 520 525 

Glu Ser Val Leu 
530 

(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1680 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(ix) FEATURE: 

(A) NAME/KEY: 5'UTR 

(B) LOCATION: 1..30 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 31.. 1656 

(ix) FEATURE: 

(A) NAME/KEY: 3'UTR 

(B) LOCATION: 1657.. 1680 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 



# 



AAAGAAGAGA CCCTCCTAGA AAAGTAAAAT ATG ACT AAA AGC AAT GGA GAA GAG 54 

Met Thr Lys Ser Asn Gly Glu Glu 
1 5 

CCC AAG ATG GGG GGC AGG ATG GAG AGA TTC CAG CAG GGA GTC CGT AAA 102 
Pro Lys Met Gly Gly Arg Met Glu Arg Phe Gin Gin Gly Val Arg Lys 
10 15 20 

CGC ACA CTT TTG GCC AAG AAG AAA GTG CAG AAC ATT ACA AAG GAG GTT 150 
Arg Thr Leu Leu Ala Lys Lys Lys Val Gin Asn lie Thr Lys Glu Val 
25 30 35 40 

GTT AAA AGT TAG CTG TTT CGG AAT GCT TTT GTG CTG CTC ACA GTC ACC 198 
Val Lys Ser Tyr Leu Phe Arg Asn Ala Phe Val Leu Leu Thr Val Thr 

45 50 55 

GCT GTC ATT GTG GGT ACA ATC CTT GGA TTT ACC CTC CGA CCA TAC AGA 246 
Ala Val lie Val Gly Thr lie Leu Gly Phe Thr Leu Arg Pro Tyr Arg 
60 65 70 

ATG AGC TAC CGG GAA GTC AAG TAC TTC TCC TTT CCT GGG GAA CTT CTG 294 
Met Ser Tyr Arg Glu Val Lys Tyr Phe Ser Phe Pro Gly Glu Leu Leu 
75 80 85 

ATG AGG ATG TTA CAG ATG CTG GTC TTA CCA CTT ATC ATC TCC AGT CTT 342 
Met Arg Met Leu Gin Met Leu Val Leu Pro Leu lie lie Ser Ser Leu 
90 95 100 

GTC ACA GGA ATG GCG GCG CTA GAT AGT AAG GCA TCA GGG AAG TGG GAA 390 
Val Thr Gly Met Ala Ala Leu Asp Ser Lys Ala Ser Gly Lys Trp Glu 
i05 110 115 120 

TGC GGA GCT GTA GTC TAT TAT ATG ACT ACC ACC ATC ATT GCT GTG GTG 438 
Cys Gly Ala Val Val Tyr Tyr Met Thr Thr Thr lie lie Ala Val Val 

125 130 135 

ATT GGC ATA ATC ATT GTC ATC ATC ATC CAT CCT GGG AAG GGC ACA AAG 486 
He Gly He He He Val He He He His Pro Gly Lys Gly Thr Lys 
140 145 150 

GAA AAC ATG CAC AGA GAA GGC AAA ATT GTA CGA GTG ACA GCT GCA GAT 534 
Glu Asn Met His Arg Glu Gly Lys He Val Arg Val Thr Ala Ala Asp 
155 160 165 

GCC TTC CTG GAC TTG ATC AGG AAC ATG TTA AAT CCA AAT CTG GTA GAA 582 
Ala Phe Leu Asp Leu He Arg Asn Met Leu Asn Pro Asn Leu Val Glu 
170 175 180 

GCC TGC TTT AAA CAG TTT AAA ACC AAC TAT GAG AAG AGA AGC TTT AAA 630 
Ala Cys Phe Lys Gin Phe Lys Thr Asn Tyr Glu Lys Arg Ser Phe Lys 
185 190 195 200 

GTG CCC ATC CAG GCC AAC GAA ACG CTT GTG GGT GCT GTG ATA AAC AAT 678 
Val Pro He Gin Ala Asn Glu Thr Leu Val Gly Ala Val He Asn Asn 

205 210 215 

GTG TCT GAG GCC ATG GAG ACT CTT ACC CGA ATC ACA GAG GAG CTG GTC 726 
Val Ser Glu Ala Met Glu Thr Leu Thr Arg He Thr Glu Glu Leu Val 
220 225 230 



CCA GTT CCA GGA TCT GTG AAT GGA GTC AAT GCC CTG GGT CTA GTT GTC 774 
Pro Val Pro Gly Ser Val Asn Gly Val Asn Ala Leu Gly Leu Val Val 
235 240 245 

TTC TCC ATG TGC TTC GGT TTT GTG ATT GGA AAC ATG AAG GAA CAG GGG 822 
Phe Ser Met Cys Phe Gly Phe Val lie Gly Asn Met Lys Glu Gin Gly 
250 255 260 

CAG GCC CTG AGA GAG TTC TTT GAT TCT CTT AAC GAA GCC ATC ATG AGA 870 
Gin Ala Leu Arg Glu Phe Phe Asp Ser Leu Asn Glu Ala lie Met Arg 
265 270 275 280 

CTG GTA GCA GTA ATA ATG TGG TAT GCC CCC GTG GGT ATT CTC TTC CTG 918 
Leu Val Ala Val lie Met Trp Tyr Ala Pro Val Gly lie Leu Phe Leu 

285 290 295 

ATT GCT GGG AAG ATT GTG GAG ATG GAA GAC ATG GGT GTG ATT GGG GGG 966 
lie Ala Gly Lys lie Val Glu Met Glu Asp Met Gly Val lie Gly Gly 
300 305 310 

CAG CTT GCC ATG TAC ACC GTG ACT GTC ATT GTT GGC TTA CTC ATT CAC 1014 
Gin Leu Ala Met Tyr Thr Val Thr Val lie Val Gly Leu Leu lie His 
315 320 325 

GCA GTC ATC GTC TTG CCA CTC CTC TAC TTC TTG GTA ACA CGG AAA AAC 1062 
Ala Val lie Val Leu Pro Leu Leu Tyr Phe Leu Val Thr Arg Lys Asn 
330 335 340 

CCT TGG GTT TTT ATT GGA GGG TTG CTG CAA GCA CTC ATC ACC GCT CTG 1110 
Pro Trp Val Phe lie Gly Gly Leu Leu Gin Ala Leu lie Thr Ala Leu 
345 350 355 360 

GGG ACC TCT TCA AGT TCT GCC ACC CTA CCC ATC ACC TTC AAG TGC CTG 1158 
Gly Thr Ser Ser Ser Ser Ala Thr Leu Pro lie Thr Phe Lys Cys Leu 

365 370 375 

GAA GAG AAC AAT GGC GTG GAC AAG CGC GTC ACC AGA TTC GTG CTC CCC 1206 
Glu Glu Asn Asn Gly Val Asp Lys Arg Val Thr Arg Phe Val Leu Pro 
380 385 390 

GTA GGA GCC ACC ATT AAC ATG GAT GGG ACT GCC CTC TAT GAG GCT TTG 1254 
Val Gly Ala Thr lie Asn Met Asp Gly Thr Ala Leu Tyr Glu Ala Leu 
395 400 405 

GCT GCC ATT TTC ATT GCT CAA GTT AAC AAC TTT GAA CTG AAC TTC GGA 1302 
Ala Ala lie Phe lie Ala Gin Val Asn Asn Phe Glu Leu Asn Phe Gly 
410 415 420 

CAA ATT ATT ACA ATC AGC ATC ACA GCC ACA GCT GCC AGT ATT GGG GCA 1350 
Gin lie lie Thr lie Ser lie Thr Ala Thr Ala Ala Ser lie Gly Ala 
425 430 435 440 

GCT GGA ATT CCT CAG GCG GGC CTG GTC ACT ATG GTC ATT GTG CTG ACA 1398 
Ala Gly lie Pro Gin Ala Gly Leu Val Thr Met Val lie Val Leu Thr 

445 450 455 

TCT GTC GGC CTG CCC ACT GAC GAC ATC ACG CTC ATC ATC GCG GTG GAC 1446 
Ser Val Gly Leu Pro Thr Asp Asp lie Thr Leu lie lie Ala Val Asp 
460 465 470 



TGG TTC TTG GAT CGC CTC CGG ACC ACC ACC AAC GTA CTG GGA GAC TCC 1494 
Trp Phe Leu Asp Arg Leu Arg Thr Thr Thr Asn Val Leu Gly Asp Ser 
475 480 485 

CTG GGA GCT GGG ATT GTG GAG CAC TTG TCA CGA CAT GAA CTG AAG AAC 1542 
Leu Gly Ala Gly lie Val Glu His Leu Ser Arg His Glu Leu Lys Asn 
490 495 500 

AGA GAT GTT GAA ATG GGT AAC TCA GTG ATT GAA GAG AAT GAA ATG AAG 1590 
Arg Asp Val Glu Met Gly Asn Ser Val lie Glu Glu Asn Glu Met Lys 
505 510 515 520 

AAA CCA TAT CAA CTG ATT GCA CAG GAC AAT GAA ACT GAG AAA CCC ATC 1638 
Lys Pro Tyr Gin Leu lie Ala Gin Asp Asn Glu Thr Glu Lys Pro lie 

525 530 535 

GAC AGT GAA ACC AAG ATG TAGACTAACA TAAAGAAACA CTTT 1680 
Asp Ser Glu Thr Lys Met 
540 

(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 542 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

Met Thr Lys Ser Asn Gly Glu Glu Pro Lys Met Gly Gly Arg Met Glu 
15 10 15 

Arg Phe Gin Gin Gly Val Arg Lys Arg Thr Leu Leu Ala Lys Lys Lys 
20 25 30 

Val Gin Asn lie Thr Lys Glu Val Val Lys Ser Tyr Leu Phe Arg Asn 
35 40 45 

Ala Phe Val Leu Leu Thr Val Thr Ala Val He Val Gly Thr He Leu 
50 55 60 

Gly Phe Thr Leu Arg Pro Tyr Arg Met Ser Tyr Arg Glu Val Lys Tyr 
65 70 75 80 

Phe Ser Phe Pro Gly Glu Leu Leu Met Arg Met Leu Gin Met Leu Val 

85 90 95 

Leu Pro Leu He He Ser Ser Leu Val Thr Gly Met Ala Ala Leu Asp 
100 105 110 

Ser Lys Ala Ser Gly Lys Trp Glu Cys Gly Ala Val Val Tyr Tyr Met 
115 120 125 

Thr Thr Thr He He Ala Val Val He Gly He He He Val He He 
130 135 140 



lie His Pro Gly 
145 

lie Val Arg Val 



Met Leu Asn Pro 
180 

Asn Tyr Glu Lys 
195 

Leu Val Gly Ala 
210 

Thr Arg lie Thr 
225 

Val Asn Ala Leu 



lie Gly Asn Met 
260 

Ser Leu Asn Glu 
275 

Ala Pro Val Gly 
290 

Glu Asp Met Gly 

305 



Val lie Val Gly 



Tyr Phe Leu Val 
340 

Leu Gin Ala Leu 
355 

Leu Pro lie Thr 
370 

Arg Val Thr Arg 
385 

Gly Thr Ala Leu 



Asn Asn Phe Glu 
420 

Ala Thr Ala Ala 
435 

Val Thr Met Val 
450 



Lys Gly Thr Lys 
150 

Thr Ala Ala Asp 
165 

Asn Leu Val Glu 



Arg Ser Phe Lys 
200 

Val lie Asn Asn 
215 

Glu Glu Leu Val 
230 

Gly Leu Val Val 
245 

Lys Glu Gin Gly 



Ala lie Met Arg 
280 

lie Leu Phe Leu 
295 

Val He Gly Gly 
310 



Leu Leu He His 
325 

Thr Arg Lys Asn 



He Thr Ala Leu 
360 

Phe Lys Cys Leu 
375 

Phe Val Leu Pro 
390 

Tyr Glu Ala Leu 
405 

Leu Asn Phe Gly 



Ser He Gly Ala 
440 

He Val Leu Thr 
455 



Glu Asn Met His 
155 

Ala Phe Leu Asp 
170 

Ala Cys Phe Lys 
185 

Val Pro He Gin 



Val Ser Glu Ala 
220 

Pro Val Pro Gly 
235 

Phe Ser Met Cys 
250 

Gin Ala Leu Arg 
265 

Leu Val Ala Val 



He Ala Gly Lys 
300 

Gin Leu Ala Met 
315 



Ala Val He Val 
330 

Pro Trp Val Phe 
345 

Gly Thr Ser Ser 



Glu Glu Asn Asn 
380 

Val Gly Ala Thr 
395 

Ala Ala He Phe 
410 

Gin He He Thr 
425 

Ala Gly He Pro 



Ser Val Gly Leu 
460 



Arg Glu Gly Lys 
160 

Leu He Arg Asn 
175 

Gin Phe Lys Thr 
190 

Ala Asn Glu Thr 
205 

Met Glu Thr Leu 



Ser Val Asn Gly 
240 

Phe Gly Phe Val 
255 

Glu Phe Phe Asp 
270 

He Met Trp Tyr 
285 

He Val Glu Met 



Tyr Thr Val Thr 
320 



Leu Pro Leu Leu 
335 

He Gly Gly Leu 
350 

Ser Ser Ala Thr 
365 

Gly Val Asp Lys 



He Asn Met Asp 
400 

He Ala Gin Val 
415 

He Ser He Thr 
430 

Gin Ala Gly Leu 
445 

Pro Thr Asp Asp 



life Thr Leu lie lie 
465 

Thr Thr Asn Val Leu 

485 

Leu Ser Arg His Glu 
500 

Val He Glu Glu Asn 
515 

Asp Asn Glu Thr Glu 

530 



Ala Val Asp Trp Phe Leu 
470 475 

Gly Asp Ser Leu Gly Ala 

490 

Leu Lys Asn Arg Asp Val 
505 

Glu Met Lys Lys Pro Tyr 
520 

Lys Pro He Asp Ser Glu 

535 



Asp Arg Leu Arg Thr 

480 

Gly He Val Glu His 
495 

Glu Met Gly Asn Ser 
510 

Gin Leu He Ala Gin 
525 

Thr Lys Met 
540 



(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1800 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(ix) FEATURE: 

(A) NAME/KEY: 5'UTR 

(B) LOCATION: 1..33 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 34.. 1755 

(ix) FEATURE: 

(A) NAME/KEY: 3'UTR 

(B) LOCATION: 1756.. 1800 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

GATAGTGCTG AAGAGGAGGG GCGTTCCCAG ACC ATG GCA TCT ACG GAA GGT GCC 54 

Met Ala Ser Thr Glu Gly Ala 
1 5 

AAC AAT ATG CCC AAG CAG GTG GAA GTG CGA ATG CCA GAC AGT CAT CTT 102 
Asn Asn Met Pro Lys Gin Val Glu Val Arg Met Pro Asp Ser His Leu 
10 15 20 

GGC TCA GAG GAA CCC AAG CAC CGG CAC CTG GGC CTG CGC CTG TGT GAC 150 
Gly Ser Glu Glu Pro Lys His Arg His Leu Gly Leu Arg Leu Cys Asp 
25 30 35 

AAG CTG GGG AAG AAT CTG CTG CTC ACC CTG ACG GTG TTT GGT GTC ATC 198 
Lys Leu Gly Lys Asn Leu Leu Leu Thr Leu Thr Val Phe Gly Val He 
40 45 50 55 

CTG GGA GCA GTG TGT GGA GGG CTT CTT CGC TTG GCA TCT CCC ATC CAC 246 
Leu Gly Ala Val Cys Gly Gly Leu Leu Arg Leu Ala Ser Pro He His 

60 65 70 



CCT GAT GTG GTT ATG TTA ATA GCC TTC CCA GGG GAT ATA CTC ATG AGG 294 
Pro Asp Val Val Met Leu lie Ala Phe Pro Gly Asp lie Leu Met Arg 
75 80 85 

ATG CTA AAA ATG CTC ATT CTG GGT CTA ATC ATC TCC AGC TTA ATC ACA 342 
Met Leu Lys Met Leu lie Leu Gly Leu lie lie Ser Ser Leu lie Thr 
90 95 100 

GGG TTG TCA GGC CTG GAT GCT AAG GCT AGT GGC CGC TTG GGC ACG AGA 390 
Gly Leu Ser Gly Leu Asp Ala Lys Ala Ser Gly Arg Leu Gly Thr Arg 
105 110 115 

GCC ATG GTG TAT TAC ATG TCC ACG ACC ATC ATT GCT GCA GTA CTG GGG 438 
Ala Met Val Tyr Tyr Met Ser Thr Thr He He Ala Ala Val Leu Gly 
120 125 130 135 

GTC ATT CTG GTC TTG GCT ATC CAT CCA GGC AAT CCC AAG CTC AAG AAG 486 
Val He Leu Val Leu Ala He His Pro Gly Asn Pro Lys Leu Lys Lys 

140 145 150 

CAG CTG GGG CCT GGG AAG AAG AAT GAT GAA GTG TCC AGC CTG GAT GCC 534 
Gin Leu Gly Pro Gly Lys Lys Asn Asp Glu Val Ser Ser Leu Asp Ala 
155 160 165 

TTC CTG GAC CTT ATT CGA AAT CTC TTC CCT GAA AAC CTT GTC CAA GCC 582 
Phe Leu Asp Leu He Arg Asn Leu Phe Pro Glu Asn Leu Val Gin Ala 
170 175 180 

TGC TTT CAA CAG ATT CAA ACA GTG ACG AAG AAA GTC CTG GTT GCA CCA 630 
Cys Phe Gin Gin He Gin Thr Val Thr Lys Lys Val Leu Val Ala Pro 
185 190 195 

CCG CCA GAC GAG GAG GCC AAC GCA ACC AGC GCT GAA GTC TCT CTG TTG 678 
Pro Pro Asp Glu Glu Ala Asn Ala Thr Ser Ala Glu Val Ser Leu Leu 
200 205 210 215 

AAC GAG ACT GTG ACT GAG GTG CCG GAG GAG ACT AAG ATG GTT ATC AAG 726 
Asn Glu Thr Val Thr Glu Val Pro Glu Glu Thr Lys Met Val He Lys 

220 225 230 

AAG GGC CTG GAG TTC AAG GAT GGG ATG AAC GTC TTA GGT CTG ATA GGG 774 
Lys Gly Leu Glu Phe Lys Asp Gly Met Asn Val Leu Gly Leu He Gly 
235 240 245 

TTT TTC ATT GCT TTT GGC ATC GCT ATG GGG AAG ATG GGA GAT CAG GCC 822 
Phe Phe He Ala Phe Gly He Ala Met Gly Lys Met Gly Asp Gin Ala 
250 255 260 

AAG CTG ATG GTG GAT TTC TTC AAC ATT TTG AAT GAG ATT GTA ATG AAG 870 
Lys Leu Met Val Asp Phe Phe Asn He Leu Asn Glu He Val Met Lys 
265 270 275 

TTA GTG ATC ATG ATC ATG TGG TAC TCT CCC CTG GGT ATC GCC TGC CTG 918 
Leu Val He Met He Met Trp Tyr Ser Pro Leu Gly He Ala Cys Leu 
280 285 290 295 

ATC TGT GGA AAG ATC ATT GCA ATC AAG GAC TTA GAA GTG GTT GCT AGG 966 
He Cys Gly Lys He He Ala He Lys Asp Leu Glu Val Val Ala Arg 

300 305 310 



CAA CTG GGG ATG TAC ATG GTA ACA GTG ATC ATA GGC CTC ATC ATC CAC 1014 
Gin Leu Gly Met Tyr Met Val Thr Val lie lie Gly Leu lie lie His 
315 320 325 

GGG GGC ATC TTT CTC CCC TTG ATT TAC TTT GTA GTG ACC AGG AAA AAC 1062 
Gly Gly lie Phe Leu Pro Leu lie Tyr Phe Val Val Thr Arg Lys Asn 
330 335 340 

CCC TTC TCC CTT TTT GCT GGC ATT TTC CAA GCT TGG ATC ACT GCC CTG 1110 
Pro Phe Ser Leu Phe Ala Gly lie Phe Gin Ala Trp lie Thr Ala Leu 
345 350 355 

GGC ACC GCT TCC AGT GCT GGA ACT TTG CCT GTC ACC TTT CGT TGC CTG 1158 
Gly Thr Ala Ser Ser Ala Gly Thr Leu Pro Val Thr Phe Arg Cys Leu 
360 365 370 375 

GAA GAA AAT CTG GGG ATT GAT AAG CGT GTG ACT AGA TTC GTC CTT CCT 1206 
Glu Glu Asn Leu Gly lie Asp Lys Arg Val Thr Arg Phe Val Leu Pro 

380 385 390 

GTT GGA GCA ACC ATT AAC ATG GAT GGT ACA GCC CTT TAT GAA GCG GTG 1254 
Val Gly Ala Thr lie Asn Met Asp Gly Thr Ala Leu Tyr Glu Ala Val 
395 400 405 

GCC GCC ATC TTT ATA GCC CAA ATG AAT GGT GTT GTC CTG GAT GGA GGA 1302 
Ala Ala He Phe He Ala Gin Met Asn Gly Val Val Leu Asp Gly Gly 
410 415 420 

CAG ATT GTG ACT GTA AGC CTC ACA GCC ACC CTG GCA AGC GTC GGC GCG 1350 
Gin He Val Thr Val Ser Leu Thr Ala Thr Leu Ala Ser Val Gly Ala 
425 430 435 

GCC AGT ATC CCC AGT GCC GGG CTG GTC ACC ATG CTC CTC ATT CTG ACA 1398 
Ala Ser He Pro Ser Ala Gly Leu Val Thr Met Leu Leu He Leu Thr 
440 445 450 455 

GCC GTG GGC CTG CCA ACA GAG GAC ATC AGC TTG CTG GTG GCT GTG GAC 1446 
Ala Val Gly Leu Pro Thr Glu Asp He Ser Leu Leu Val Ala Val Asp 

460 465 470 

TGG CTG CTG GAC AGG ATG AGA ACT TCA GTC AAT GTT GTG GGT GAC TCT 1494 
Trp Leu Leu Asp Arg Met Arg Thr Ser Val Asn Val Val Gly Asp Ser 
475 480 485 

TTT GGG GCT GGG ATA GTC TAT CAC CTC TCC AAG TCT GAG CTG GAT ACC 1542 
Phe Gly Ala Gly He Val Tyr His Leu Ser Lys Ser Glu Leu Asp Thr 
490 495 500 

ATT GAC TCC CAG CAT CGA GTG CAT GAA GAT ATT GAA ATG ACC AAG ACT 1590 
He Asp Ser Gin His Arg Val His Glu Asp He Glu Met Thr Lys Thr 
505 510 515 

CAA TCC ATT TAT GAT GAC ATG AAG AAC CAC AGG GAA AGC AAC TCT AAT 1638 
Gin Ser He Tyr Asp Asp Met Lys Asn His Arg Glu Ser Asn Ser Asn 
520 525 530 535 

CAA TGT GTC TAT GCT GCA CAC AAC TCT GTC ATA GTA GAT GAA TGC AAG 1686 
Gin Cys Val Tyr Ala Ala His Asn Ser Val He Val Asp Glu Cys Lys 

540 545 550 



GTA ACT CTG GCA GCC AAT GGA AAG TCA GCC GAG TGC AGT GTT GAG GAA 1734 
Val Thr Leu Ala Ala Asn Gly Lys Ser Ala Asp Cys Ser Val Glu Glu 
555 560 565 

GAA CCT TGG AAA CGT GAG AAA TAAGGATATG AGTCTCAGCA AATTCTTGAA 1785 
Glu Pro Trp Lys Arg Glu Lys 
570 

TAAACTCCCC AGCGT 1800 

(2) INFORMATION FOR SEQ ID NO: 7: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 574 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

Met Ala Ser Thr Glu Gly Ala Asn Asn Met Pro Lys Gin Val Glu Val 
1 5 10 15 

Arg Met Pro Asp Ser His Leu Gly Ser Glu Glu Pro Lys His Arg His 
20 25 30 

Leu Gly Leu Arg Leu Cys Asp Lys Leu Gly Lys Asn Leu Leu Leu Thr 
35 40 45 

Leu Thr Val Phe Gly Val lie Leu Gly Ala Val Cys Gly Gly Leu Leu 
50 55 60 

Arg Leu Ala Ser Pro lie His Pro Asp Val Val Met Leu lie Ala Phe 
65 70 75 80 

Pro Gly Asp lie Leu Met Arg Met Leu Lys Met Leu lie Leu Gly Leu 

85 90 95 

lie lie Ser Ser Leu lie Thr Gly Leu Ser Gly Leu Asp Ala Lys Ala 
100 105 110 

Ser Gly Arg Leu Gly Thr Arg Ala Met Val Tyr Tyr Met Ser Thr Thr 
115 120 125 

lie lie Ala Ala Val Leu Gly Val He Leu Val Leu Ala He His Pro 
130 135 140 

Gly Asn Pro Lys Leu Lys Lys Gin Leu Gly Pro Gly Lys Lys Asn Asp 
145 150 155 160 

Glu Val Ser Ser Leu Asp Ala Phe Leu Asp Leu He Arg Asn Leu Phe 

165 170 175 

Pro Glu Asn Leu Val Gin Ala Cys Phe Gin Gin He Gin Thr Val Thr 
180 185 190 

Lys Lys Val Leu Val Ala Pro Pro Pro Asp Glu Glu Ala Asn Ala Thr 



195 200 205 

Ser Ala Glu Val Ser Leu Leu Asn Glu Thr Val Thr Glu Val Pro Glu 
210 215 220 

Glu Thr Lys Met Val lie Lys Lys Gly Leu Glu Phe Lys Asp Gly Met 
225 230 235 240 

Asn Val Leu Gly Leu lie Gly Phe Phe lie Ala Phe Gly lie Ala Met 

245 250 255 

Gly Lys Met Gly Asp Gin Ala Lys Leu Met Val Asp Phe Phe Asn lie 
260 265 270 

Leu Asn Glu lie Val Met Lys Leu Val lie Met lie Met Trp Tyr Ser 
275 280 285 

Pro Leu Gly lie Ala Cys Leu lie Cys Gly Lys lie lie Ala lie Lys 
290 295 300 

Asp Leu Glu Val Val Ala Arg Gin Leu Gly Met Tyr Met Val Thr Val 
305 310 315 320 

lie lie Gly Leu lie lie His Gly Gly lie Phe Leu Pro Leu lie Tyr 

325 330 335 

Phe Val Val Thr Arg Lys Asn Pro Phe Ser Leu Phe Ala Gly lie Phe 
340 345 350 

Gin Ala Trp lie Thr Ala Leu Gly Thr Ala Ser Ser Ala Gly Thr Leu 
355 360 365 



Pro Val Thr Phe Arg Cys Leu Glu Glu Asn Leu Gly lie Asp Lys Arg 
370 375 380 

Val Thr Arg Phe Val Leu Pro Val Gly Ala Thr lie Asn Met Asp Gly 
385 390 395 400 

Thr Ala Leu Tyr Glu Ala Val Ala Ala lie Phe lie Ala Gin Met Asn 

405 410 415 

Gly Val Val Leu Asp Gly Gly Gin lie Val Thr Val Ser Leu Thr Ala 
420 425 430 

Thr Leu Ala Ser Val Gly Ala Ala Ser lie Pro Ser Ala Gly Leu Val 
435 440 445 

Thr Met Leu Leu lie Leu Thr Ala Val Gly Leu Pro Thr Glu Asp lie 
450 455 460 

Ser Leu Leu Val Ala Val Asp Trp Leu Leu Asp Arg Met Arg Thr Ser 
465 470 475 480 

Val Asn Val Val Gly Asp Ser Phe Gly Ala Gly lie Val Tyr His Leu 

485 490 495 

Ser Lys Ser Glu Leu Asp Thr lie Asp Ser Gin His Arg Val His Glu 
500 505 510 



Asp lie Glu Met Thr Lys Thr Gin Ser lie Tyr Asp Asp Met Lys Asn 
515 520 525 

His Arg Glu Ser Asn Ser Asn Gin Cys Val Tyr Ala Ala His Asn Ser 
530 535 540 

Val lie Val Asp Glu Cys Lys Val Thr Leu Ala Ala Asn Gly Lys Ser 
545 550 555 560 

Ala Asp Cys Ser Val Glu Glu Glu Pro Trp Lys Arg Glu Lys 

565 570 

(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1674 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECXJLE TYPE: cDNA 

(ix) FEATURE: 

(A) NAME/KEY: 5'UTR 

(B) LOCATION: 1..15 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 16.. 1590 

(ix) FEATURE: 

(A) NAME/KEY: 3'UTR 

(B) LOCATION: 1591.. 1674 



(xi) SEQUENCE DESCRIPTION: SEQ "ID NO: 8: 

ATAGCGGCGA CAGCC ATG GGG AAA CCG GCG AGG AAA GGA TGC CCG AGT TGG 51 

Met Gly Lys Pro Ala Arg Lys Gly Cys Pro Ser Trp 
15 10 

AAG CGC TTC CTG AAG AAT AAC TGG GTG TTG CTG TCC ACC GTG GCC GCG 99 
Lys Arg Phe Leu Lys Asn Asn Trp Val Leu Leu Ser Thr Val Ala Ala 
15 20 25 

GTG GTG CTA GGC ATT ACC ACA GGA GTC TTG GTT CGA GAA CAC AGC AAC 147 
Val Val Leu Gly lie Thr Thr Gly Val Leu Val Arg Glu His Ser Asn 
30 35 40 

CTC TCA ACT CTA GAG AAA TTC TAC TTT GCT TTT CCT GGA GAA ATT CTA 195 
Leu Ser Thr Leu Glu Lys Phe Tyr Phe Ala Phe Pro Gly Glu lie Leu 
45 50 55 60 

ATG CGG ATG CTG AAA CTC ATC ATT TTG CCA TTA ATT ATA TCC AGC ATG 243 
Met Arg Met Leu Lys Leu lie lie Leu Pro Leu lie lie Ser Ser Met 

65 70 75 



ATT ACA GGT GTT GCT GCA CTG GAT TCC AAC GTA TCC GGA AAA ATT GGT 
lie Thr Gly Val Ala Ala Leu Asp Ser Asn Val Ser Gly Lys lie Gly 
80 85 90 



291 



CTG CGC GCT GTC GTG TAT TAT TTC TGT ACC ACT CTC ATT GCT GTT ATT 339 
Leu Arg Ala Val Val Tyr Tyr Phe Cys Thr Thr Leu lie Ala Val lie 
95 100 105 

CTA GGT ATT GTG CTG GTG GTG AGC ATC AAG CCT GGT GTC ACC CAG AAA 387 
Leu Gly He Val Leu Val Val Ser He Lys Pro Gly Val Thr Gin Lys 
110 115 120 

GTG GGT GAA ATT GCG AGG ACA GGC AGC ACC CCT GAA GTC AGT ACG GTG 435 
Val Gly Glu He Ala Arg Thr Gly Ser Thr Pro Glu Val Ser Thr Val 
125 130 135 140 

GAT GCC ATG TTA GAT CTC ATC AGG AAT ATG TTC CCT GAG AAT CTT GTC 483 
Asp Ala Met Leu Asp Leu He Arg Asn Met Phe Pro Glu Asn Leu Val 

145 150 155 

CAG GCC TGT TTT CAG CAG TAC AAA ACT AAG CGT GAA GAA GTG AAG CCT 531 
Gin Ala Cys Phe Gin Gin Tyr Lys Thr Lys Arg Glu Glu Val Lys Pro 
160 165 170 

CCC AGC GAT CCA GAG ATG AAC ATG ACA GAA GAG TCC TTC ACA GCT GTC 579 
Pro Ser Asp Pro Glu Met Asn Met Thr Glu Glu Ser Phe Thr Ala Val 
175 180 185 

ATG ACA ACT GCA ATT TCC AAG AAC AAA ACA AAG GAA TAC AAA ATT GTT 627 
Met Thr Thr Ala He Ser Lys Asn Lys Thr Lys Glu Tyr Lys He Val 
190 195 200 

GGC ATG TAT TCA GAT GGC ATA AAC GTC CTG GGC TTG ATT GTC TTT TGC 675 
Gly Met Tyr Ser Asp Gly He Asn Val Leu Gly Leu He Val Phe Cys 
205 210 215 220 

CTT GTC TTT GGA CTT GTC ATT GGA AAA ATG GGA GAA AAG GGA CAA ATT 723 
Leu Val Phe Gly Leu Val He Gly Lys Met Gly Glu Lys Gly Gin He 

225 230 235 

CTG GTG GAT TTC TTC AAT GCT TTG AGT GAT GCA ACC ATG AAA ATC GTT 771 
Leu Val Asp Phe Phe Asn Ala Leu Ser Asp Ala Thr Met Lys He Val 
240 245 250 

CAG ATC ATC ATG TGT TAT ATG CCA CTA GGT ATT TTG TTC CTG ATT GCT 819 
Gin He He Met Cys Tyr Met Pro Leu Gly He Leu Phe Leu He Ala 
255 260 265 

GGG AAG ATC ATA GAA GTT GAA GAC TGG GAA ATA TTC CGC AAG CTG GGC 867 
Gly Lys He He Glu Val Glu Asp Trp Glu He Phe Arg Lys Leu Gly 
270 275 280 

CTT TAC ATG GCC ACA GTC CTG ACT GGG CTT GCA ATC CAC TCC ATT GTA 915 
Leu Tyr Met Ala Thr Val Leu Thr Gly Leu Ala He His Ser He Val 
285 290 295 300 

ATT CTC CCG CTG ATA TAT TTC ATA GTC GTA CGA AAG AAC CCT TTC CGA 963 
He Leu Pro Leu He Tyr Phe He Val Val Arg Lys Asn Pro Phe Arg 

305 310 315 

TTT GCC ATG GGA ATG GCC CAG GCT CTC CTG ACA GCT CTC ATG ATC TCT 1011 
Phe Ala Met Gly Met Ala Gin Ala Leu Leu Thr Ala Leu Met He Ser 
320 325 330 



TCC AGT TCA GCA ACA CTG CCT GTC ACC TTC CGC TGT GCT GAA GAA AAT 1059 
Ser Ser Ser Ala Thr Leu Pro Val Thr Phe Arg Cys Ala Glu Glu Asn 
335 340 345 

AAC CAG GTG GAC AAG AGG ATC ACT CGA TTC GTG TTA CCC GTT GGT GCA 1107 
Asn Gin Val Asp Lys Arg lie Thr Arg Phe Val Leu Pro Val Gly Ala 
350 355 360 

ACA ATC AAC ATG GAT GGG ACC GCG CTC TAT GAA GCA GTG GCA GCG GTG 1155 
Thr lie Asn Met Asp Gly Thr Ala Leu Tyr Glu Ala Val Ala Ala Val 
365 370 375 380 

TTT ATT GCA CAG TTG AAT GAC CTG GAC TTG GGC ATT GGG CAG ATC ATC 1203 
Phe lie Ala Gin Leu Asn Asp Leu Asp Leu Gly lie Gly Gin lie lie 

385 390 395 

ACC ATC AGT ATC ACG GCC ACA TCT GCC AGC ATC GGA GCT GCT GGC GTG 1251 
Thr lie Ser lie Thr Ala Thr Ser Ala Ser lie Gly Ala Ala Gly Val 
400 405 410 

CCC CAG GCT GGC CTG GTG ACC ATG GTG ATT GTG CTG AGT GCC GTG GGC 1299 
Pro Gin Ala Gly Leu Val Thr Met Val lie Val Leu Ser Ala Val Gly 
415 420 425 

CTG CCC GCC GAG GAT GTC ACC CTG ATC ATT GCT GTC GAC TGG CTC CTG 1347 
Leu Pro Ala Glu Asp Val Thr Leu lie lie Ala Val Asp Trp Leu Leu 
430 435 440 

GAC CGG TTC AGG ACC ATG GTC AAC GTC CTT GGT GAT GCT TTT GGG ACG 1395 
Asp Arg Phe Arg Thr Met Val Asn Val Leu Gly Asp Ala Phe Gly Thr 
445 450 455 460 

GGC ATT GTG GAA AAG CTC TCC AAG AAG GAG CTG GAG CAG ATG GAT GTT 1443 
Gly lie Val Glu Lys Leu Ser Lys Lys Glu Leu Glu Gin Met Asp Val 

465 470 475 

TCA TCT GAA GTC AAC ATT GTG AAT CCC TTT GCC TTG GAA TCC ACA ATC 1491 
Ser Ser Glu Val Asn lie Val Asn Pro Phe Ala Leu Glu Ser Thr lie 
480 485 490 

CTT GAC AAC GAA GAC TCA GAC ACC AAG AAG TCT TAT GTC AAT GGA GGC 1539 
Leu Asp Asn Glu Asp Ser Asp Thr Lys Lys Ser Tyr Val Asn Gly Gly 
495 500 505 

TTT GCA GTA GAC AAG TCT GAC ACC ATC TCA TTC ACC CAG ACC TCA CAG 1587 
Phe Ala Val Asp Lys Ser Asp Thr lie Ser Phe Thr Gin Thr Ser Gin 
510 515 520 

TTC TAGGGCCCCT GGCTGCAGAT GACTGGAAAC AAGGAAGGAC ATTTCGTGAG 1640 

Phe 

525 

AGTCATCTCA AACACGGCTT AAGGAAAAGA GAAA 1674 



(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 525 amino acids 



' . ' (B) TYPE: amino acid 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

Met Gly Lys Pro Ala Arg Lys Gly Cys Pro Ser Trp Lys Arg Phe Leu 
15 10 15 

Lys Asn Asn Trp Val Leu Leu Ser Thr Val Ala Ala Val Val Leu Gly 
20 25 30 

lie Thr Thr Gly Val Leu Val Arg Glu His Ser Asn Leu Ser Thr Leu 
35 40 45 

Glu Lys Phe Tyr Phe Ala Phe Pro Gly Glu lie Leu Met Arg Met Leu 
50 55 60 

Lys Leu lie lie Leu Pro Leu lie lie Ser Ser Met lie Thr Gly Val 
65 70 75 80 

Ala Ala Leu Asp Ser Asn Val Ser Gly Lys lie Gly Leu Arg Ala Val 

85 90 95 

Val Tyr Tyr Phe Cys Thr Thr Leu lie Ala Val lie Leu Gly lie Val 
100 105 110 

Leu Val Val Ser lie Lys Pro Gly Val Thr Gin Lys Val Gly Glu lie 
115 120 125 

Ala Arg Thr Gly Ser Thr Pro Glu Val Ser Thr Val Asp Ala Met Leu 
130 135 140 

Asp Leu lie Arg Asn Met Phe Pro Glu Asn Leu Val Gin Ala Cys Phe 
145 150 155 160 

Gin Gin Tyr Lys Thr Lys Arg Glu Glu Val Lys Pro Pro Ser Asp Pro 

165 170 175 

Glu Met Asn Met Thr Glu Glu Ser Phe Thr Ala Val Met Thr Thr Ala 
180 185 190 

lie Ser Lys Asn Lys Thr Lys Glu Tyr Lys lie Val Gly Met Tyr Ser 
195 200 205 

Asp Gly lie Asn Val Leu Gly Leu lie Val Phe Cys Leu Val Phe Gly 
210 215 220 

Leu Val lie Gly Lys Met Gly Glu Lys Gly Gin lie Leu Val Asp Phe 
225 230 235 240 



Phe Asn Ala Leu Ser Asp Ala Thr Met Lys lie Val Gin lie lie Met 

245 250 255 

Cys Tyr Met Pro Leu Gly lie Leu Phe Leu lie Ala Gly Lys lie lie 
260 265 270 

Glu Val Glu Asp Trp Glu lie Phe Arg Lys Leu Gly Leu Tyr Met Ala 



275 280 285 

Thr Val Leu Thr Gly Leu Ala lie His Ser lie Val lie Leu Pro Leu 
290 295 300 

lie Tyr Phe lie Val Val Arg Lys Asn Pro Phe Arg Phe Ala Met Gly 
305 310 315 320 

Met Ala Gin Ala Leu Leu Thr Ala Leu Met lie Ser Ser Ser Ser Ala 

325 330 335 

Thr Leu Pro Val Thr Phe Arg Cys Ala Glu Glu Asn Asn Gin Val Asp 
340 345 350 

Lys Arg lie Thr Arg Phe Val Leu Pro Val Gly Ala Thr lie Asn Met 
355 360 365 

Asp Gly Thr Ala Leu Tyr Glu Ala Val Ala Ala Val Phe lie Ala Gin 
370 375 380 

Leu Asn Asp Leu Asp Leu Gly lie Gly Gin lie lie Thr lie Ser lie 
385 390 395 400 

Thr Ala Thr Ser Ala Ser lie Gly Ala Ala Gly Val Pro Gin Ala Gly 

405 410 415 

Leu Val Thr Met Val lie Val Leu Ser Ala Val Gly Leu Pro Ala Glu 
420 425 430 

Asp Val Thr Leu lie lie Ala Val Asp Trp Leu Leu Asp Arg Phe Arg 
435 440 445 

Thr Met Val Asn Val Leu Gly Asp Ala Phe Gly Thr Gly He Val Glu 
450 455 460 

Lys Leu Ser Lys Lys Glu Leu Glu Gin Met Asp Val Ser Ser Glu Val 
465 470 475 480 

Asn lie Val Asn Pro Phe Ala Leu Glu Ser Thr lie Leu Asp Asn Glu 

485 490 495 

Asp Ser Asp Thr Lys Lys Ser Tyr Val Asn Gly Gly Phe Ala Val Asp 
500 505 510 



Lys Ser Asp Thr He Ser Phe Thr Gin Thr Ser Gin Phe 
515 520 525 



(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 



CGCGGGTACC GCCATGGAGA AGAGCAAC 
(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

CGCGTCTAGA TCACAGAACC GACTCCTTG 

(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

CGCGGGTACC AATATGACTA AAAGCAATG 

(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

CGCGTCTAGA CTACATCTTG GTTTCACTG 

(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

CGCGGGTACC ACCATGGCAT CTACGGAAG 

(2) INFORMATION FOR SEQ ID NO: 15: 



.. (^i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

CGCGTCTAGA TTATTTCTCA CGTTTCCAAG 

(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 

CGCGGGTACC GCCATGGGGA AACCGGCG 

(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 
CGCGGGATCC CTAGAACTGT GAGGTCTG 



